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Abstract 
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Channel (MISO-BC) subject to general linear constraints is considered. Such constraints include, as 
special cases, the sum power, the per-antenna or per-group-of-antennas power, and "forbidden interference 
direction" constraints. We consider both the optimal dirty-paper coding and the simple suboptimal linear 
zero-forcing beamforming strategies, and provide numerically efficient algorithms that solve the problem 
in its most general form. As an application, we consider a multi-cell scenario with partial cell cooperation, 
where each cell optimizes its precoder by taking into account interference constraints on specific users in 
adjacent cells. The effectiveness of the proposed methods is evaluated in a simple system scenario 
including two adjacent cells, under different fairness criteria that emphasize the bottleneck role of 
users near the cell "boundary". Our results show that "active" Inter-Cell Interference (ICI) mitigation 
outperforms the conventional "static" ICI mitigation based on fractional frequency reuse. 
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I. Problem setup and summary of the results 

The next generation of wireless communication systems (e.g., 802.16m [1], LTE-Advanced |2}) is 
expected to capitalize on the large spectral efficiency gains promised by multiuser MIMO communications. 

The fundamental information-theoretic model for the downlink of a cellular system comprising one 
base-station equipped with M transmit antennas and K user terminals, each one with one or more 
receiving antennas, is the well-known Gaussian MIMO Broadcast Channel (MIMO-BC) (3J-Q- m tnis 
work we restrict to the case of single-antenna receivers, commonly referred to as the Multi-Input Single- 
Output Broadcast Channel (MISO-BC). One channel use of the MISO-BC is described by 

y k = h"x + z k , k = l,...,K (1) 

where h k G C denotes the channel vector of user k, x G C M is the transmitted signal vector and 
z k ~ CjV(0, 1) is a complex circularly symmetric additive white Gaussian noise (AWGN). 

Let S denote a compact set of M x M covariance matrices. The capacity region of the MISO-BC ([T} 
subject to the input constraint E[xx H ] = S x G S is given by the set of rate vectors R G E; 

C bc (5;H) = coh |J |J 

(2) 

where the M x K channel matrix H = [hi, ... , hx] collects all user channel vectors. The capacity region 
Cbc(5; H) is achieved by Gaussian Dirty-Paper Coding (DPC), where the permutation 7r = (tt\, . . . , -kk) 
of the user indices {1, . . . , K} indicates the successive encoding order where user m is encoded first 
and user ttk is encoded last. The transmit covariance matrix is given by Yl x = Ylk=i v fc v fe9fe an( ^ ^ i s 
defined by the unit-norm "steering vectors" {v^} and by the users transmit powers {q k }- 

The transmitter parameters {v^}, {%}, tt achieving the boundary of Cb c (5; H) can be determined by 
solving the Weighted Rate Sum Maximization (WSRM) problem 

K 

maximize W k R k 

k=l 

subject to R G C hc (S; H) (3) 

for some suitable choice of the nonnegative weights {Wfc}- Although a direct solution of ^ is generally 
difficult, for the special case where the constraint set S is defined by linear inequalities 

tr(E x **)< 7/ , £ = 1,...,L, (4) 
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where {*&£} are. positive semidefinite symmetric matrices and {7^} are non-negative coefficients, the 
solution of ([3]) can be computed efficiently by solving a sequence of convex problems. 

By the Heine-Borel theorem, the compactness of S implies that S is bounded with respect to the 
Frobenius norm. Hence, without loss of generality, we can always include an additional sum-power 
constraint tr(S x ) < P for some sufficiently large P, without modifying the problem. This corresponds 
to including an additional constraint with 70 = P and 3>o = I in the set of constraints (Q. Important 
special cases of the constraint Q, beyond the sum-power constraint already discussed (3j-|[6), include 
the per-antenna power constraint and the per-group of antennas constraint |8j. For rank-1 = c^c^, we 
have a general "interference" constraint where the vector eg denotes a forbidden direction along which 
the transmit power must be not larger than 7^ (9J. 

Linear beamforming is a suboptimal precoding strategy that provides a low-complexity alternative to 
DPC. When combined with Gaussian random coding, the following region is achievable 

< C («S;H) = coh |J (jfr<log(l+ ' hgv fe., 8/ , V V 4 



(5) 



The optimization of the transmitter parameters {v k },{qk} is generally more difficult than with DPC 
since the WSRM problem with linear beamforming has no general convex programming equivalent. In 
this work we focus on the popular Zero-Forcing Beamforming (ZFBF) for at least three good reasons: 



1) in the regime of high SNR and/or if combined with user selection |10|, |11| it yields near-optimal 



performance; 2) its performance is well-understood and extensively analyzed also in the case of non-ideal 



channel state information [12]-|14|, and therefore it is practically relevant for applications; 3) it lends 
itself to a computationally efficient solution. 

The WSRM problem with ZFBF and general linear constraints is given by 



K 



maximize ^ W k log ( 1 + \h"v k \ 2 q k 

k=l 

subject to hj v fc = V j 7^ k 

tr(£^)<7^ W (6) 

Without loss of fundamental generality we consider the case where H has rank K with K < M, otherwise 



the problem is infeasible. If K > M, some user selection algorithm such as those proposed in [10], [11 1 
takes care of selecting an "active user subset" of size not larger than M. In fact in this paper, |10| is 
combined with the algorithm we develop to solve (|6]). 
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In the rest of this paper we consider problems Q and ([6]) subject to general linear constraints. Although 
these problems have been addressed in a number of recent papers, a thorough comparison of the various 
algorithms and a unified presentation has been missing. As far as problem ([3]) is concerned, we show the 
equivalence of the "SINR-duality" (9J and the "min-max duality" ||8j approaches, and give the computation 
details of the infeasible start Newton iteration algorithm for general linear constraints (only sketched in 
(8j for the case of per-antenna power constraint). Also, we show through simulation examples that this 
algorithm converges much faster and it is generally less complex than the inner-outer iterative algorithm 
based on Lagrangian duality and sub-gradient search advocated in [9]. As far as problem Q, we generalize 
the convex relaxation approach of fl3| to the case of linear constraints and arbitrary rate weights (not 
immediately obvious from p"5|). We solve the convex relaxation problem using a novel gradient descent 
algorithm with logarithmic barrier. Also, we propose a novel two-step iterative algorithm that updates 
directly the steering vectors of the ZF precoder, building on the form of generalized inverses. This new 
method is significantly more computationally efficient than the approach based on convex relaxation since 
it avoids the dimensionality expansion of convex relaxation. Finally, we use the proposed optimization 
algorithms in a multi-cell ICI mitigation scheme, where each cell optimizes its transmit covariance matrix 
by taking into account an interference constraint on the edge users of the adjacent cell. In a simple linear 
two-cell scenario, it is showed by Monte Carlo simulation that this approach can effectively improve the 
rate of the edge users and achieves generally better rates than simpler conventional schemes such as FFR. 

II. WSRM ALGORITHMS FOR DPC 

Consider the problem ([3]) where S is a compact convex set defined by the linear constraints Q, 
including a (possibly irrelevant) sum-power constraint, as said before. Without loss of generality, assume 
W\ > • • • > Wk > 0. In [9], using a technique called "Signal-to-Interference plus Noise Ratio (SINR) 
duality", the following fundamental results are proved. Define the "dual MAC" corresponding to ([T]) as 
the multiple-access Gaussian channel 

K 

y = ^2 hkXk + z ^ 

k=l 

where y,z £ C M , z ~ £A/"(0, ~E Z (\)) with X! 2 (A) = ^2d = Q ^£^t f° r some vector of non-negative 
auxiliary variables A > and each transmitter has power constraint E[|xfc| 2 ] < p^, subject to a total 
sum-power constraint 

K L 

^Pk<^\ae (8) 

k=l 1=0 
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Then, for any A > 0, the value of the original MISO-BC WSRM problem is upperbounded by the value 
of the new MAC WSRM problem 

K 

maximize WkRk 
fe=l 

subject to Re C mac (9) 

where C mac denotes the capacity region of the dual MAC defined above for given parameters A, {&e} and 
{7^}. Furthermore, the upperbound provided by the dual MAC is tight. Letting the value of the problem 
j9), for given A, be denoted by g(X), then the weighted rate sum of the MISO-BC can be obtained by 
minimizing g(X) with respect to A > 0. Hence, the sought solution can be obtained by iterating between 
one "outer problem", for the minimization of g(X), and an "inner problem", that solves ^ for fixed A. 

Owing to the polymatroid structure of the capacity region of the Gaussian MAC |16J, the solution of 
([9]) is found at the vertex of the MAC capacity region dominant face corresponding to the successive 
decoding order K , K — 1, . . . , 1. Then, g{X) is obtained by solving 



K 

max > Wh lo 
P >o ' 
fe=l 



L K L 

subject to £*(A) =J^A^, ^Pk < J^A^. (10) 



e=o k=i e=o 



For fixed A, (10) can be solved using the Lagrangian duality approach of 1 17 1, as done in p8| . 

As far as the the minimization of g(X) is concerned, this can be obtained the sub-gradient update of 
the auxiliary variables in the form A(n + 1) = A(n) — e n s(A(n)), where X(n) denotes the current value 
of A at iteration n, s(A(n)) is a subgradient of g(X) at A = A(n), and e n = £0^+5 is the adaptation 
step, for some suitable parameters eo, b > 0. A subgradient for this problem is given by the vector with 
components (9j 

atW = 7*-tr(E x (A)**), 

where ^^(A) denotes the transmit covariance matrix of the MISO-BC corresponding to the dual MAC 
at given A. The calculation of the subgradient requires to map the dual MAC solution {^(A)}, {wfc(A)} 
into the corresponding solution (powers and steering vectors) of the MISO-BC in order to determine 
T, X (X) = J2k=i yr k{X)^(X)qk(X). This is obtained by the well-known "MAC-to-BC" transformations 
0. 

An alternative approach to the solution of problem ([3]) can be obtained by extending the "min-max 
duality" approach of (8j to the case of general linear constraints and arbitrary rate weights. Consider the 
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downlink power minimization problem with SINR constraints and general linear constraints: 

minimize P 

subject to SINRf>%, \/k 



tr fe w * w k**J ^7/, W (11) 



where = y^Vfc denotes the unnormalized downlink beamforming vectors, the downlink SINR for 
user k is given by 

SINR? 



|h£wfcl 



l + E^ fc |h fc H w, 

and rjk denotes the SINR target for user k. 



J 



Theorem 1: The downlink beamforming problem (111 has the following Lagrangian dual form which 
is equivalent to a dual uplink problem with the same SINR constraints and under a worst-case noise 
condition: 

K L 

max min V" p k - V" X e -y e 
A>o P>o,{w fc} ^ ^ 

»|w"h fc | 2 

SUbjeCtt ° w"S' fA)w, + r D -lw"h-P 

S' 2 (A) = I + J]A^ (12) 

where and {A^} are the dual variables for the SINR constraint and general linear constraint, 

respectively. 

Proof: See Appendix [A] ■ 

Notice that the sum power in the corresponding dual MISO-BC (downlink) is not explicitly constrained. 
In contrast, the objective function is modified by a discount factor that includes the "noise" variables 
Ai, . . . , Al- At the optimal point, the MISO-BC sum power is given by P* = Yjk=iP*k ~ J2e=i ^tll- 
Thus, P* is generally smaller than the dual-MAC sum power ^2^ = iP* k - This suggests that for a given 
total power budget P* (fixed), we may need to reduce the downlink transmit power in order to satisfy the 
linear constraints. The same duality holds if we consider DPC successive encoding in some given order 
(say: it = (ttx, . . . , ttr)) and successive interference cancelation in the dual-MAC reverse order (say: ttk 
decoded first and tt\ decoded last). Since the capacity region of the multiuser MISO downlink channel 
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is obtained as convex hull of the union of DPC-achievable regions over all possible transmit covariances 
satisfying a general convex constraint (see Q), and since the SINRs for each rate point of such regions 
are also achievable in the dual MAC, we conclude that the capacity region of the downlink subject to the 
general linear constraints coincides with the capacity region of a virtual dual MAC with worst-case noise 
covariance, where the co variance matrix is parameterized by E^(A) in the specific form given above. 

Consider now the downlink WSRM problem ([3]) where S is defined by general linear constraints Q, 
including the sum-power constraint tr(S x ) < P. Letting again, without loss of generality, the weights be 
ordered such that W\ > ■ ■ ■ > Wk, the resulting min-max dual MAC problem is given by: 



K 



mm max 
A>o p>° 



k=l 



subject to S^(A) = I + ^ 

1=1 

K L 



(13) 



k=l 



By comparing ( 13 1 with ( 10 1 and recalling that the value g(X) of ( 10) must be minimized with respect to 
A > 0, we notice that the only difference between the two formulations is the presence of the auxiliary 



variable Aq in ( 10), related to the sum-power constraint. However, it is immediate to see that the solution 



of (10 1 is invariant to a common scaling of the vector of auxiliary variables A since this would affect 
in the same way both the noise covariance and the signal power constraint. Without loss of generality, 



letting Ao = 1 in ( [TO] ) yields the same optimization problem as (13 1. 

Therefore, the infeasible start Newton method of 1 8 ] can be used as an alternative to the inner (Lagrange 
duality) - outer (subgradient) iterative algorithm advocated in (91. Since this algorithm is only sketched 
in (8] for the case of per-antenna power constraint, and several computation steps are left to the reader, 
we give the details here for the case of general linear constraints. First, we define the modified objective 



function for ( 13 ) 



K 


L k 




L 


/ t (p,A) = J]A fc log 




- W x log 


i + ^2 X t®t 


k=l 


1=1 j=l 




1=1 




(14) 

where t > is a parameter that controls a "logarithmic barrier" term in order to prevent the iterative 
algorithm to approach the boundaries where some elements in p or in A may become zero or negative 
and where we define = — W^+i with Wk+i = 0. The logarithmic barrier guarantees that the 
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optimal value of the problem can be approached with gap t L . Along the iterations, the value t shall 
be increased in order to make this gap as small as desired. 

The problem is convex with respect to A and concave with respect to p, with Lagrangian function 



(neglecting the non-negativity constraints and using the modified objective function (14)) given by 

£(p, A, n) = f t (p, A) - fi (l T p - P - 7 t a) (15) 
with 7 = (71, . . . ,7l) T and p = (pi, . . . ,Pk) J ■ The necessary and sufficient conditions for optimality 



are given by the KKT conditions [19]: 

ri 

f"3 



a/*(p,A) 

dp 

dft(p,X) 
d\ 

P + 7 T A- l T p = 



fil = 



+ fij = 



(16) 



The vector r = (rj, rj, rs) T of dimension K + L + 1 is the so-called "residual" of the KKT equations. 
The algorithm finds a direction and a step for updating the variables (p, A, n) > such that, as the 
number of iterations grows, the norm of the residual tends to zero. The updating direction is given by 
d = — (Vr) -1 r, where Vr is the KKT matrix, given by 



Vr 



grj dri _-i 

dr 2 dr 2 



-1 T 7 T 







(17) 



Letting for simplicity the vector of variables be denoted by x = (p T ,A T ,/i) T , the algorithm takes the 
following form: 

1) Fix the algorithm parameters v > 1, and 6 > 0. Initialize x(0) to some positive values and let 
n = 0, and t = 1. 

2) Compute the updating direction d(n) at x(n). (see explicit expressions of the derivatives given later 
on). 

3) Update x(n + 1) = x(n) + sd(n) where s is found by backtracking line search: initialize s = 1 
and find s such as, while 



|r(x(n) + sd(n))|| > (1 — as) ||r(x(n) 



then s <— /3s, where (3 G (0, 1) and a G (0, 1/2) are fixed constants, (typical values are a = 0.3 
and (3 = 0.8). 
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4) If ||r(x(n + 1))|| < 5, move to the next step, otherwise set n <— n + 1 and go back to step 2. 

5) If < 5, exit and accept the value of x(n + 1) as the final value, otherwise set t <— vt and 
n <— n + 1 and go back to step 2. 

Explicit expressions for the elements of the KKT matrix Vr can be obtained using matrix calculus (see 
for example |20| and references therein). Letting ^ 

K 



-i 



, we find 



6 



dp* 

dr 2 



fc=max{i ,j} 
K 



5j 
tp 



'■L 

2 



k=i 
A 



<9r 2 



A fc tr (^fc^-^fc^) + Witv (* *j*o*i 



fc=i 



a? 



It should be noticed that the above terms are particularly easily computed in the relevant case where the 
constraint matrices have rank 1, i.e., for <1>^ = c^ct 1 , as in the case of interference direction constraints 



(see example in Section [TV] ). 

Figs. [T] and [2] show an example of convergence of the inner-outer iterative algorithm and infeasible start 
Newton algorithm with the same system parameters with M = 4 antennas, K = 3 users, unit weights for 
all users {Wk = 1) and L = 2 forbidden interference directions. In order to allow independent replication 
of our numerical experiments, the channel and interference direction vectors are provided in Table. [I] 
The sum power constraint is set equal to 10 and the two interference constraints are set equal to 5. The 
evolution along the algorithm iterations of the objective function (sum rate) and of the sum-power and 
interference values are shown. As the figures reveal, both algorithms converge to the same optimal values 
and satisfy the given sum-power and interference power constraints with equality. However the workload 
required to achieve sufficient convergence is different for each algorithm. The number of M x M matrix 
multiplications per iteration captures to first order the overall algorithms complexity, as the workload of 
the remaining operations is significantly lower. The infeasible start Newton algorithm has slightly higher 
complexity per iteration, 0(K{K+L 2 )), than the complexity of the inner-outer iterative algorithm, which 
is 0(K 2 ). However, this difference is almost negligible in the relevant case where K is significantly larger 
than Furthermore, as seen from Figs. [l]and|2j the Newton algorithm requires a significantly smaller 
number of iterations to converge. Therefore, the Newton algorithm has a clear advantage in the case 
K 3> L. For example, using a MATLAB implementation on a Intel Core 2 Windows XP machine, for 



'Here <5;,j denotes Kronecker's delta, equal to 1 if i = j and to otherwise. 

2 Typically, the number of users per cell K is much larger than the number of constrained interference direction L. 
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TABLE I 

Example of channel and interference direction vectors 



hi 


h 2 


h 3 


Cl 


C2 


-0.70 + 0.82i 


0.20 - l.lOi 


0.30- 0.22i 


-0.83 + 0.81* 


-0.53 + 0.44i 


0.09 + 0.11 


-0.70 + 0.90i 


-0.50- 0.65i 


0.78 + 0.87i 


1.33 - 0.26i 


1.15 + 0.04i 


0.42 - 0.5H 


0.87 - 0.76i 


0.45 - 0.45i 


0.27 + 0.39i 


-0.95 + 0.77i 


1.00-0.18i 


-0.77- 1.13i 


0.78 + 0.55i 


-0.83 - 0.87i 



the snapshots of Figs. [T] and [2] the infeasible start Newton algorithm run-time is 58 ms while the inner- 
outer iterative algorithm run-time is 197 ms (about 3 times slower). A similar advantage was noticed in 
a large number of Monte Carlo experiments (not reported here for because of space limitations) with 
randomly generated channel vectors. 

III. Novel WSRM algorithms for ZFBF 

The WSRM problem with ZFBF ([6]) can be reformulated in terms of unnormalized transmit matrices 
(i.e., including the transmit powers) as 

K 



maximize ^ Wk log ^1 + h£ T^h* 

fc=i 

subject to h^Tfchj =0 Vj ^ k 

tr fe T **<J W 

T k h 0, rank(T fc ) = 1, \/k (18) 
Problem ([T8]) is not convex due to the rank-1 constraint. A convex relaxation of the original problem is 



obtained by removing the rank-1 constraint. In [15] the problem is solved for the equal-weight case and 



per-antenna constraint and it is shown that the convex relaxation problem has always a rank-1 solution. 
Following in the footsteps, it is easy to show that the same holds for the general case ( [T8| ). In particular, 
letting {T£} denote a solution of the convex relaxation problem with possibly rank(T|) > 1 for some k, a 
rank-1 solution = t^t^ 1 achieving the same optimal value can be determined by finding, independently 
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for each k, the vector t& solution of: 



maximize h^t^ 



subject to h^t fe G IR+ 



h"t fc = 0, Vj^k 



trft fc tj;* £ ) <tr(T^), V£ (19) 



We notice that ( 19 1 is a Second-Order Cone Program (SOCP) |21 1 and can be easily solved by standard 



tools (e.g., see |22|). In the special case of per-antenna constraints, treated in fT5[ , ( [19] ) reduces to a 
linear program. 

Two main issues arise from the convex relaxation approach: 1) A dramatic dimensionality increase: the 
relaxed problem deals with K symmetric matrices of dimension M x M, that is, with KM(M — l)/2 = 
0{KM 2 ) variables, in contrast with the KM original variables; 2) Lack of an efficient computational 



method: in |15| the relaxed problem for equal weights takes on the form of a "MAXDET" |23J for which 
efficient solvers exist. Unfortunately, for general weights, the problem is not MAXDET and general- 
purpose convex optimizers must be used, with consequent increase of the computation burden. In the 
following we address both issues. First, we consider a dimensionality reduction of the original problem by 
eliminating the zero-forcing constraints. Then, we propose a gradient descent algorithm with logarithmic 
barrier that converged directly to the solution of the dimension-reduced convex relaxation. Finally, we 
build on the structure of generalized inverses fl3t and find a low-complexity two-step iterative algorithm 
where the powers and steering vectors are alternatively updated. The low-complexity algorithm may 
converge to a local maximum, but if this is combined with a few gradient descent steps the optimal 
solution can be approached with very high probability and with a dramatic complexity reduction. 



A. Gradient descent algorithm with barrier functions 

We start by reducing the dimensionality of ( [T8) ) by eliminating the ZF constraints. The condition 
h^Tfchj = for all j ^ k together with the fact that T k y and rank(T fc ) = 1 yields that 

T k = U fc a fc aM (20) 

where G <£ Mx ( M ~ K + 1 ) j s a unitary basis for the orthogonal complement of the subspace Spanjhj : 
j k}. Consider the SVD of H in "compact form," i.e., H = USV H with unitary U, V of dimensions 
M x K and K x K, respectively, and let U -1 be a unitary matrix of dimension M x (M — K) such 
that [UjU 1 ^] is a unitary basis for C M . In particular, U -1 is a basis for the orthogonal complement of 
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Spanjhi • • • hx}- The Moore-Penrose pseudoinverse of H H is defined by 

H + = H(H H H) _1 = US _1 V H . (21) 

It follows that the k-th column of H + is a linear combination of the columns of U and, in addition, 
it is orthogonal to all hj for j ^ k. Hence, the /c-th normalized column of H + , denoted by in the 
following, is a unit-norm vector in the orthogonal complement of Spanjhj : j 7^ k}. Since g k is a linear 
combination of the columns of U, then it is also orthogonal to all columns of U . Hence, the desired 
matrix XJ k can be obtained in the form 

U fc = [gfelU 1 ]. 

Notice that h^T k h. k = \^{\J k a. k aL^XJ^)Yi k = Ig^h^pfA/jJi i, where we define the rank-1 matrices 
Afc = aLk^k- Letting d k = |g^hfc| 2 and 3>£ 5 fc = U^^Ufc for all k and I, the dimensionality -reduced 



problem corresponding to ( 18 1 can be written as 

K 



maximize W k log (1 + d k [A k ] lt i) 

k=l 

subject to tr A fc * £>fc ^ < j e , W, 



A k y 0, rank(A fc ) = 1, Vfc. (22) 

Again, a convex relaxation of the above problem is obtained by removing the rank-1 constraints. For the 
convex relaxation, by including all constraints into logarithmic barrier functions, we obtain the modified 
objective function 

K 

/t(Ai,.-. ,A K ) = J2w k \og(l + d h [A k ]i,x) 

k=l 

+ \ l0g ~ tf fe A fc*^>^ J + l0g dCt ( Ak ^j (23) 

where t > is the logarithmic barrier control parameter, that guarantees that the optimal value of the 
problem can be approached with gap K ~f . The problem is concave with respect to Ai, • • • , Ak in the 



domain dom f t = {(Ai, • • • , Ak) : tx(Y^,f=i A?^j) — lii ^k t: 0, W, k}. We maximize (23) by 
applying the iterative gradient descent method for given t, and increase the parameter t after a sufficient 
number of iterations. Since A k is a Hermitian matrix with M — K + 1 real variables on the diagonal and 
(M - K){M - K + l)/2 complex variables off-diagonal, the problem has a total of K(M - K + l) 2 
real variables. For K = M, this represents a very significant dimensionality reduction with respect to 



the original convex relaxation of [15] (from cubic to linear in the number of antennas). However, in 
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applications involving user selection 1 10 1, |11|, [24| or when the number of antennas is significantly 
larger than the number of users, the complexity of the above method is still significant. In addition, we 
observed a very slow convergence (see examples later on). Therefore, we will explore a lower-complexity 
iterative method in the next section. 

For the sake of completeness, we conclude this section by giving explicitly the details of the gradient 



method. The partial derivatives of (23) with respect to each element of are given by: 



dft 



W k d k 



l+h"A fc h fc * 
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\m,m I i 



d [A k ] e e 



dft 
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(24) 
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Vm ^ n 



0/t 



5Re([A fc ] njm )' 
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(25) 
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dIm([A k ] n ,m) 

The update direction matrix for A k is denoted by = VaJi, with elements given by 

— , Vm = n 



(26) 



ID 



k\m,n 



d[A k 



dft 



9Re([A fe ] m ,„) +-?'<9Im([A fc ] m ,„)- 

At the n-th iteration of the gradient descent algorithm, the fcfh matrix is updated as A k (n + 1) = 
Afc(n) + sDfc(n) where the step size s is determined according to a standard backtracking line search: 
initialize s = 1 and update s <— j3s while 

K 

/ t (Ai(n+l),--. ,Ajr(n + l)) < / t (Ai(n),-.- , A K (n)) + as^^ |[D fe (n)] i;i | 2 

fc=l i>j 

or 

(Ai(n + 1),-- - ,Ajf(n + l)) £ dom/(, 

where /3 G (0, 1) and a G (0, 1/2) are fixed constants. For the given control parameter t, the matrices 
A k (n),Vk are updated until the following stopping criterion is satisfied for the convergence of the 



dft 



-, Vm 7^ n 



(27) 
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objective function in (23): 




\k=l i>j 

When the stopping criterion is satisfied, t is updated as t = vt for v > 1 and a new gradient descent 
iteration starts with new t. The solution can be mapped into a rank-1 equivalent solution by letting 
T£ = UfcA^Uj" 1 , where {A£} denotes the optimal point found by the gradient descent, and then finding 



the optimal steering vectors {t^} via (19 1 for each k. 

Some terms in the logarithmic barrier function may approach the negative infinity as the iterations 
proceed. Hence, the algorithm parameters must be designed very conservatively, allowing a very small 
step size at each iteration. For this reason, the gradient descent algorithm converges very slowly. Fig. 
[3] illustrates the convergence behavior of the gradient descent algorithm. The channel and constraint 
parameters including the channel and interference direction vectors are the same as in Figs. [T] and [2] The 
sum rate converges to the optimal values and the given sum-power and interference power approaches 
the given constraints with equality, but convergence is very slow. 

B. Two-step power and steering vector update algorithm 

We consider a new algorithm that builds on the structure of generalized inverses, and updates directly the 
steering vectors rather than working with the convex relaxation problem. In this way, the dimensionality 
of the problem is not expanded, but the possibility of converging to a local maximum exists. This problem 
will be addressed at the end of this section. In general, the zero-forcing constraint implies that the matrix 
T = [ti, . . . , tx\ of unnormalized steering vectors must be a right generalized inverse fl5| of the matrix 
H H , i.e., it can be expressed in the form 

T = [giai,...,gK-ajf] + XJ ± [b 1 ,...,h K ] 

= G + U^B (28) 



where {gfc} are the normalized columns of the Moore-Penrose pseudo-inverse (21 1 and U is a unitary 
basis of the orthogonal complement of Spanjhi • • • hx}, as defined before, where a = (ai, . . . , ax) T are 
scalar coefficients, and B is a matrix of size (M — K) x K. We seek to directly optimize the coefficients 
a and B by iterating two steps: 1) for fixed (normalized) steering vectors, optimize the power allocation; 
2) for fixed power ratios (relative powers) on the pseudo-inverse {gA,} directions, maximize a common 
scaling factor by optimizing the steering vectors. 
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Step 1. Initialize the steering vectors by letting t k = gfe, corresponding to a = 1 and B = 0. The 
ZFBF power allocation problem for fixed (not necessarily unit-norm) steering vectors is given by: 



K 



maximize ^ W k log(l + \h" t k \ 2 q k ) 



k=l 
K 

subject to: ^q k t^(t k < j e , W 

k=l 

q > (29) 

Defining the L x K matrix C with (£, k) element [C]^ = ^t^&et k , the constraint can be compactly 
written as Cq < 1. The Lagrangian for ( [29] ) is 

K 

£(q, A) = J2 W k log(l + |h" t k \ 2 q k ) - A T (Cq - 1) (30) 

k=l 

where A > is a vector of dual variables. The KKT conditions for q k yield the waterfilling-like solution 

W k 1 



QkW 



(31) 



A T C fc |hH tfc |2_ 

where c k is the k-th column of C. Using this into £(q, A), we can solve the dual problem by minimizing 
£(q(A), A) with respect to A > 0. It is immediate to check that, for any A' > 0, 

£(q(A'),A') > £(q(A),A') 

= £(q(A),A) + (l-Cq(A)) T (A / -A) (32) 

Therefore, s(A) = (1 — Cq(A)) is a subgradient for £(q(A), A). It follows that the dual problem can be 
solved by a simple L-dimensional subgradient iteration. 

Step 2. Let q denote the output of Step 1 for fixed steering vectors {t^}. In this step we fix a with 
components a k = y^g^t^, and search for the steering vectors that maximize a common power scaling 



factor rj. Using (28 1 we obtain the optimization problem 

maximize rj 



It 

with solution readily given by 



2 

subject to: — tr (tT h */) < 1 V I (33) 



max £=li ..., L A /^tr(TT H * 



where T is calculated as in (28 1, for the fixed coefficients a and for B solution of 



minimizes u u 



subject to: J — tr (TT H ^) < u V i (34) 

It 
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It is recognized that (34) is a SOCP with respect to the variables u and B, and can be solved by standard 
efficient tools (e.g., see (22|). 

The output of Step 2 is a new set of steering vectors in the form tjt = rj[g k ak + U^bfc]. These can be 
used as new fixed steering vectors for Step 1 , and the iterative algorithm can go on. Notice that, with the 
initialization t& = for all k, at the first round of Step 1 the algorithm obtains the optimal weighted rate 
sum achievable by the pseudo-inverse steering vectors. Hence, we are guaranteed to find a generalized 
inverse that performs at least as well (and usually improves upon) the pseudo-inverse, already after one 
iteration. Fig. [4] shows an example for the two-step algorithm under the same conditions of Fig. [3] In 
this case, the objective function (sum rate) and the sum-power and interference power converge to the 
same optimal values as in Fig. |3j but the convergence of the two-step algorithm is significantly faster. 

In general the two-step algorithm may converge to a local maximum since the problem at hand is 
non-convex. We investigated this effect by randomly generating a large number of channel matrices with 
i.i.d. elements ~ CAf(0, 1) and, for given linear constraints, we calculated the optimal rate sum obtained 
using the gradient algorithm and the value achieved by the two-step algorithm. We assumed M = 4 
antennas, K = 3 users, unit weights for all users, sum-power constraint equal to 10 and two interference 
constraints with random directions and constraint equal to 5. Fig. [5] shows the cumulative distribution 
function (CDF) of the ratio between the value of the two-step algorithm divided by the corresponding 
optimal value obtained via the gradient algorithm. For example, considering the solid line in Fig. [5] 
we notice that the two-step algorithm achieves a sum-rate value 5% less than the optimal with about 
10% probability. In order to improve the performance of the two-step algorithm, we can use a different 
initialization point. Since the gradient method is guaranteed to converge to the optimal point, a sensible 
choice consists of performing a limited number of (costly) gradient iterations, and then switching to 
the (computationally efficient) two-step algorithm. This approach is meaningful if the feasible point in 
the convex relaxation problem obtained by the gradient descent algorithm after an arbitrary number of 
iterations can be mapped into a feasible point for the two-step algorithm, without decreasing the value 
of the objective function. This is guaranteed by the following result: 



Theorem 2: Any feasible set of matrices { A&} of the convex relaxation of problem (22 ) (not necessarily 
of rank-1) can be mapped into a set of feasible zero-forcing steering vectors {t^} without decreasing the 
value of the weighted rate sum objective function. This is obtained by solving for each k a SOCP given 



by <|19}, with T* replaced by T k = U k A k U%. 

Proof: See Appendix [B] ■ 

By Theorem |2j a feasible point obtained after N iterations of the gradient descent algorithm can be 
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mapped into the initial (feasible) point for the two-step algorithm. For a sufficiently large number of 
gradient iterations N, the obtained initial point are "closer" to the optimal point, reducing the probability 
that the two-step algorithm gets trapped into a local maximum. Fig. [5] shows the CDF of the ratio 
(as defined before) when N = 10 and N = 100. We observe that even with a small number (e.g., 10) 
gradient iterations, the probability that the two-step algorithm achieves a value very close to the maximum 
improves significantly. This, of course, comes at an enormous saving in complexity with respect to using 
the gradient method till convergence. 

IV. Interference coordination in a multi-cell scenario 

In cellular wireless communications, the average received signal power is a polynomially decreasing 
function of the distance between transmitter and receiver. Users close to the edge of their cell experience 
relatively weak desired received signal power and strong ICI power and therefore have very poor SINR. 



In conventional cellular design [25], [26], the system capacity is essentially determined by the worst-case 



"edge" users and ICI is mitigated by some fixed allocation of the downlink transmit power to frequency 



bands, ranging from the conventional frequency reuse [25], |26| to the "Fractional Frequency Reuse" 
(FFR) schemes advocated in some recent system standardization [27|. Such strategies are "static" in 
the sense that they do not exploit the instantaneous knowledge of the interfered users' channel vectors. 
The problem of edge users can be alleviated by introducing differentiated rate services and scheduling. 



For example, data-oriented high-rate downlink schemes such as EV-DO [28] and HSDPA [29] consider 



Proportional Fair Scheduling (PFS) |30|. 



As an application of the transmitter optimization techniques developed before, in this section we 
consider a "partial cell coordination" approach, where each base-station is aware of the interfering 
channel coefficients to users in adjacent cells, and optimizes its transmitter covariance matrix subject 
to an interference threshold constraint to one or more edge users in adjacent cells. This approach can 
be regarded as an intermediate solution between a high complexity fully coordinated network MIMO 



approach [31]-|34| and a conventional FFR approach. We refer to this approach as "active" ICI mitigation 
since it exploits the instantaneous MIMO channel state information. 

In order to fully appreciate the impact of ICI mitigation, the system performance must be evaluated 



under some suitable fairness criterion [28|-[30|. In fact, in a typical setting with K > M the cell sum- 
capacity may be maximized by serving only the users near the cell center, while allocating zero rate and 
power to the edge users. This would result in a very misleading result, since the edge users would suffer 
from an unacceptably poor quality of service. In our system simulation, we considered PFS and "Hard- 
Fairness Scheduling" (HFS), where the former aims at maximizing ^2k = ilogRk and the latter aims at 
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maximizing min^L 1 Rk, where denotes the long-term average rate of user k. The scheduling algorithms 
are obtained using the general framework of virtual queues and stochastic optimization presented in J35J 



and applied to the MIMO downlink scheduling problem as done in |36| where a detailed proof of 
optimality is also given based on the Lyapunov drift technique. 

A. System model 

We consider a simple downlink system formed by two mutually interfering cells, as shown in Fig. [6] 
Base-stations are placed in position — D and D and serve K users uniformly distributed on the intervals 
[— D, 0] and [0, D], respectively. In each cell, the users are indexed such that user k = 1 is the closest to 
the base-station and user K is at the edge of the cell. We assume a distance-dependent path gain given 
by G(d) = Go/(l + (d/S) a ), where d denotes the distance between the transmitter and receiver, a is the 
pathloss exponent, 5 is the "3dB" breakpoint distance, and Go is a constant that determines the channel 
gain at the cell center. A frequency-flat block-fading channel is assumed, such that at each slot time t 
the channel vectors are fixed in time for the whole slot duration of T channel uses, and then change 
from slot to slot according to some ergodic process. The channel vectors have zero-mean i.i.d. Gaussian 
coefficients (independent Rayleigh fading), both in space (across antennas) and across the users. The 
received signal for user k = 1, . . . , K in cell n = 1,2 at slot time t is given by 

Vk,n{t) = hjj n (t) \J2^(t)u^ n (t)\ +cjj n ,(t) [^v, x (t)n^(t)j + %n (t) (35) 



from the desired cell from the interfering cell 

where n' = 1 if n = 2 and ml = 2 if n = 1 denotes the neighbor cell index, hfc jn (i) and Cfc „/(t) are the 
channel vectors from the desired cell to user k antenna and from the interfering cell to user k antenna, 
respectively, and where ~ CN(0, 1) denotes a unit- variance AWGN. As before, {vj jT1 (i), Uj tTl (t) : 

j = 1, . . . , K} and {vj jn '(t), Uj >n >(t) : j = 1, . . . , K} denote the steering vectors and the coded symbols 
transmitted by base-station n and n' to their own sets of users, where the dependence on the slot time t 
is explicitly evidenced. As described earlier, the vectors hfe in (t) have i.i.d. CA/"(0, G(dfc >n )) coefficients 
and the vectors Cfc jn /(i) have i.i.d. CJ\f(0, G(sk, n ')) coefficients, where c4 >n denotes the distance between 
user k in cell n and its desired base-station, and Sk t n' denotes the distance between user k in cell n and 
the interfering base-station. 

Consider cell n (the same operation takes place, independently and symmetrically, in cell n'). At 
each slot time t, given the knowledge of the desired user channels {hj n (t) : j = 1, . . . , K} and of 
the interference directions to the adjacent users {cj n (t) : j = 1,...,K}, and given the scheduling 
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algorithm that determines the weights {Wk, n (t)}, the transmitter in cell n optimizes its steering vectors 
and transmit powers by solving ([3]) or ([6]), depending on whether DPC or ZFBF is considered, subject 
to the constraints: 

tr (S x ) < P, sum-power 
cjj n (t)E x CK,n{t) < e, ICI to the edge user 
Following the intuition gained by the recent results on the Gaussian interference channel (371, the ICI 
power threshold e is set equal to the noise level (i.e., equal to 1 in this case), so that the presence of ICI 
degrades the edge users' SINR by at most 3 dB. The user rates resulting from solving ([3]) or ([6]) under 



(36 1 are used to update the scheduling algorithm and to compute the weights to be used in the next slot. 



The details of the scheduler equations are omitted for the sake of space limitation and can be found in 



By construction, the ICI power for user K is not larger than e. For all other users k ^ K, since 
i c k,n'(t) '■ k < K} are independent of CK,n'(P) an d of {hj,n'(£) : j = 1, • ■ • , K}, the average interference 
power is given by 



E 



K 

J2\ C kMt)Vj,n'(t)u j>n ,(t)\ 
3=1 



G{s Kn ,)P 



It follows that the noise plus interference power at user k receiver in cell n is given by 

f 1 + e for k = K 

N kn = { (37) 
[ l + G(s k>n ,)P for k + K 

These different "equivalent noise" variances can be incorporated as factors in the channel vectors, in order 

to solve the WSRM problem in the same form as given in the previous sections (details are omitted for 

brevity). In our simulations for ZFBF, we use the user selection algorithm that was proposed, under the 

standard sum-power constraint, in |10J and extend it to the case of non-equal weights and ICI constraints 

considered here (details are omitted for the sake of brevity). 

B. Simulation results 

In the simulations of this section the cell radius is set to D = 1 km and the other system parameters 



follow the Mobile WiMAX performance evaluation specification [38]. Under PFS and HFS, the proposed 
interference coordination scheme is compared with an FFR interference mitigation scheme where the total 
system bandwidth is split into two equal subbands, and the base-stations' transmit power is allocated over 
the subbands such that cell 1 uses power 2Pp in the first subband and 2P(1 — p) in the second subband 
for < p < 1, and cell 2 uses the reverse allocation. The total base-station power per subband is equal to 
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P. However, with this arrangement, the edge users in cell n (where n = 1,2) can be served on the higher- 
power subband, and are interfered by the lower-power subband of the other cell. In the extreme case of 
p = 0, the FFR scheme reduces to classical reuse-2 and for p = 1/2 we have a reuse- 1 system. With 
FFR, we run conventional DPC and ZFBF WSRM problems combined with the scheduling algorithms 
mentioned above, subject only to the sum-power constraints on each subband, with the knowledge of the 
average ICI power from the adjacent cell, but no instantaneous knowledge of the interference channels 
{ c k,n{t), Ck,n'(t)}- The details of MIMO downlink scheduling in a multi-cell scenario with FFR can be 
found in (34). 

We considered M = 4 transmit antennas per base-station, and K = 4 users per cell. In the case of 



active interference mitigation, the ICI constraint ( 36 ) is imposed, whereby the ICI threshold e is set equal 
to 1, i.e., equal to the noise power, as described before. Based on the typical settings of [38], the path 
gain parameters are given as a = 3.504, 5 = 0.036 km, and Go = —91.64 dB and the transmit power 
normalized by the noise power is given as P = 154 dB. Fig. [JJ shows the long-term average user rates 
in each of the two cells as a function of the user location for DPC under PFS. For the system where the 
precoder optimization is subject only to the sum-power constraint, and with frequency reuse 1, the edge 
user in each cell (i.e., user K, located close to 0) has very small average rate with respect to the center 
users. The edge users' rate is significantly improved by the proposed interference coordination, with a 
negligible decrease of the center user rates. Also FFR is able to increase the edge user rates with respect 
to the no-coordination reuse 1 case, but the improvement is less significant and comes at the expenses of 
a rate degradation for the center users. For example, for p = (frequency reuse 2), the edge user rates 
are increased with respect to the p = 1 (frequency reuse 1) case, but the center user rates are reduced 
by almost a factor of 2. As a representative example of the FFR performance, we show in the figure the 
case of p = 0.1. 

Fig. [8] shows analogous results for DPC under HFS. This scheduling scheme achieves indeed the equal 
long-term average rate point of the system (all users have the same long-term average rate). Therefore, the 
bottleneck effect of edge users is even more evident than with PFS. In this case, the proposed interference 
coordination scheme achieves the best user rates and the reuse-2 achieves the best rates over all values 
of p for the FFR case. 

Figs. [9] and 10 show the corresponding results for the case of ZFBF under PFS and HFS, respectively. 
Observation of these figures reveals trends very similar to those exhibited by DPC precoders in Figs. [JJ 
and GO 
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V. Concluding remarks 

We considered the transmitter optimization problem in a MISO broadcast channel subject to general 
linear constraints, under both the optimal DPC and the simpler linear ZFBF precoding schemes. In 
this work, we showed the equivalence of "SINR-duality" (9j and "min-max duality" |8j approaches, 
we provided the details of an infeasible start Newton iteration algorithm and we showed by extensive 
simulation that it converges significantly faster than the inner-outer iterative algorithm advocated in [9]. 
For the case of ZFBF, the convex relaxation approach of p"5| was extended to the case of general linear 
constraints and arbitrary rate weights. We proposed a novel gradient descent algorithm with logarithmic 
barrier and we addressed the problem of dimensionality reduction for the convex relaxation problem. 
Also, we proposed a novel two-step iterative algorithm that updates directly the steering vectors of the 
ZF precoder, building on the form of generalized inverses. Finally, we showed an application of these 
algorithms in a multi-cell ICI mitigation scheme, where each cell optimizes its transmit covariance matrix 
by taking into account an interference constraint on the edge users of the adjacent cell. 

After the submission of this manuscript, we became aware of two interesting related works that are 



worth mentioning. In 1 39 1, linear Zero-Forcing under general linear constraints is addressed for the general 
case of multi-antenna receivers (the full MIMO-BC) case. The approach of [39] ], particularized to the 
MISO case, yields a different algorithm that also avoids the dimensionality explosion of convex relaxation. 



An accurate complexity comparison between the algorithm of [39 ] and the one proposed in Section III-B 



would be of some interest. In [40], the A'-user Gaussian MISO interference channel subject to Gaussian 
coding, linear beamforming and treating interference as noise is studied, and a new parameterization of 
the Pareto-optimal boundary of the beamforming achievable region is obtained. In this formulation, every 
transmitter maximizes the rate to its own desired user subject to interference constraints to the other users. 
This approach is clearly related to our active interference coordination example. It is expected that by 
extending the approach of [40] to the case of broadcast with interference (where each base station serves 
many users, as in our case) a systematic way to set the interference threshold can be found, beyond the 
sensible "rule of thumb" that we used in our example. 
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Appendix A 
Proof of TheoremQ] 

The proof follows closely in the footsteps of [8], generalizing the per-antenna constraint to arbitrary 



linear constraints. The Lagrangian function of the power optimization problem ( fTl) is given by 

L 

+ 5> 



C(P,{w k },p,\) = P + X 



\fe=i / \ e=i I \k=i ) 



It 



K 

k=l 



|h " Wfc ' 2 -E|h fe %| 2 "l 



(38) 



where p = (pi, • • • ,Pk), A = (Ai, • • • , Al) are the dual variables for the SINR constraints and general 
linear constraints, respectively. We rewrite ([38]) as 



A 



£(P,{w fc },p,A) = ^2 Pk -^2x ae + p(i-X Q ) 



k=l 



A 



+ w, H ( A i + A ^ + 5>; h A H - ~ hkh " I wfc 



k=l 



?]k 



(39) 



(40) 



Letting E' Z (A) = Aol + Y^,t=i ^i&l* we obtain the Lagrangian dual objective function: 

£(p,A)= min C(P, {w fc }, p, A) 

•P,{w fc j 

It is obvious that Q = — oo if 1 — Ao < or the matrix 

3+k 

is not positive semidefinite. On the other hand, adding the constraint < Ao < 1 and the positive 
semidefmiteness, the dual problem takes on the equivalent form: 

K L 



maximize p k — Xg^g 



k=l 



subject to H' g (\) + h i h i Pj ^ — h fc h fc > Vfc 

V / % 

< Ao < 1, A > 0, p > 



(41) 



Notice that the solution with respect to Ao is trivially obtained by letting Ao = 1. Hence, we shall replace 
Ao by 1 in the following. As shown in (8] Lemma 1], the semidefinite constraints can be rewritten in 
terms of uplink SINRs as 



SINR]; 1 = p k h H k [ S^(A) + 5>Ahy ) h fc < m . 

3+k 
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where the k-th uplink SINR is the SINR at the output of a linear MMSE receiver defined by the 
beamforming vector 



A' 



k=l 



Finally, since the SINR constraints must be attained with equality, they can be reversed while turning 
the maximization with respect to A into a minimization, so that the dual problem is given in the desired 



form (12 1 



Appendix B 
Proof of Theorem[2] 



Let {At} be a solution of the convex relaxation of (22) (i.e., after neglecting the rank-1 constraints). 



This can be reformulated with respect to the auxiliary "slack" variables S as follows: 

K 

maximize y] Wk log (1 + /Xfc(S)) 



k=l 



A 



subject to a y 0, ^2[S]k,e < le, 

1 

where /ifc(E) is the solution of the auxiliary problem: 

MS) = 

subject to tr(A k &i) < [S] fe) , 



(42) 



fc=i 



max [A fe ]i i 



W 



(43) 



where are defined as in (22i. 



Problem (43) is a special case of the problem 

maximize u H Au 

subject to A y 0, tr(*^A) < rj£, Ml (44) 

where u, y 0} and i]£ > are given vector, matrices and constants, respectively. Let A* denote a 
solution of (44 1 and assume that the problem is bounded J^] Let a* be the vector solution to the problem 

maximize Re{u H a} 



subject to a SD^a < rje, V 



(45) 



In 1 15 Lemma 1] it is shown that |u a*| = u A*u. In turns, thanks to the chain of equivalent problems 



given above, this fact implies that a rank-1 solution for ( 18 1 can be found from the solution of the convex 



3 Since we always consider a sum-power constraint, in our case the problem is always bounded. 
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relaxation by using ( 19). Here, we are interested to show that any feasible point Tfc = U^A^U^ 1 of the 
convex relaxation problem can be mapped into a rank-1 feasible point without decreasing the value of 
the objective function (weighted rate sum). 



This is proved if we show that for any feasible point A of (44) there exists a vector a such that aa H 



is also feasible for (44 1 and |u H a*| 2 > u H Au. Furthermore, a can be found by solving a SOCP of the 



type of ( |45) 
Define 



u H Au 

a = -rr- < 1 (46) 

u H A*u - v ' 



and let a = ^/aa*. Then, we obtain 

|u H a| 2 = a|u H a*| 2 = u H Au, 

and 

a H \E^a = aa* H \E^a* < rjg. 

Hence a is a feasible vector achieving the same objective function of A (this shows existence). Now, 
consider the SOCP 

maximize Re{u H a} 

subject to a H ^a < tr(*^A), V £ (47) 



Letting a denote the solution of ( 47 1, since by construction a is a feasible point of (47 1, we have 



|u H a| 2 > |u H a| 2 = u H Au which is what we wanted to show. 
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Fig. 1. Rate and power convergence behavior of inner-outer iterative algorithm for DPC with M — 4 and K = 3 under the 
sum transmit power and interference constraints with L — 2 forbidden directions. The dots on the "x" axis indicate when the 
outer subgradient iteration is activated. 




Fig. 2. Rate and power convergence behavior of infeasible start Newton algorithm for DPC under the same conditions of 
Fig. 00 
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Fig. 3. Rate and power convergence behavior of gradient descent algorithm for ZFBF under the same conditions of Fig 
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Fig. 4. Rate and power convergence behavior of two-step algorithm for ZFBF under the same conditions of Fig. [T] 
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Fig. 5. Cumulative distribution of the sum rate of two-step algorithm normalized by the optimal sum rate 
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Fig. 6. Two-cell multi-user MISO downlink system model. 
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Fig. 7. User rate of proportional fairness scheduling for DPC with M = 4 and K = 4. 
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Fig. 8. User rate of hard fairness scheduling for DPC with M = 4 and K = 4. 
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Fig. 9. User rate of proportional fairness scheduling for ZFBF with M = 4 and K — 4. 
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Fig. 10. User rate of hard fairness scheduling for ZFBF with M — 4 and K = 4. 



